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' coefficients of this sample. In contrast with quadratic forms computed on the Fourier coef- 

^ ' ficients such as the periodogram, the scalogram involves correlations which do not vanish 

■ as the sample size n 00. 
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1. Introduction 

We consider a real-valued process X =^ {Xk}k&, not necessarily stationary and for any 
positive integer fc, let A'^X denote its k-ih. order difference. The first order difference is 
[AX]t =^ Xt — Xt-i and A*^ is defined recursively. 

Definition 1 {M{d) processes). The process X is said to have memory parameter d, d gM. 
(in short, is an M(d) process) and short-range spectral density f* if for any integer k > 
d— 1/2, the k-th order difference process A'^X is weakly stationary with spectral density 
function 

W(A) = |l-e-^^|2('=-'^)r(A) AG(-vr,7r), (1) 

where f* is a non-negative symmetric function which is continous and non-zero at the 
origin. 

M.(d) processes encompass both stationary and non-stationary processes, depending on 
the value of the memory parameter d. The function 

f{X) = \l-e~'Y"'rW (2) 

is called the generalized spectral density of X. It is a proper spectral density function when 
d < 1/2. In this case, the process X is covariance stationary with spectral density function 
/. The process X is said to have long-memory if < d < 1/2, short-memory if d = and 
negative memory if d < 0; the process is not invertible if d < —1/2. The factor /* is a 
nuisance function which determine the "short-range" dependence. 

In a typical semiparametric estimation setting (see for instance 0,0,0]), the following 
additional assumption is often considered. 

Assumption 1. There exists P G (0,2], 7 > and e G (0, vr] such that for all A G [— e,e], 

ir(A)-r(o)i <Lr(o)|A|^ (3) 

Moreover, /*(0) > 0. 

We consider an M[d) process satisfying the following linear assumption. 
Assumption 2. There exists a non-negative integer K such that 

[A^X], = J]aW(fc-t)6, (4) 

where {aW(t), t G 1] is a real-valued sequence satisfying X^j(a'-^Hi))^ < oo and 

(A-1) {£,1,1 G Z} is a sequence of independent and identically distributed real- valued 
random variables such that E[^o] = 0, ]E[,^q] = 1 and K4 *== IE[,^q] — 3 is finite. 
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Here the linear assumption may only apply to a X-order increment of X to allow X to 
be non-stationary. 

Our goal is to estimate d by using a Discrete Wavelet Transform (DWT) of X. In order 
to study the asymptotic properties of the estimator, we use a central limit theorem for an 



array of squares of decimated linear processes, established in [10(], see Theorems 1 and 2 in 
this reference. Using this result, we extend to the non-Gaussian linear processes setting, 
asymptotic normality results for wavelet estimation of the memory parameter d, that have 
been obtained so far for Gaussian processes (see 0, Thoerem 1] and 0, Theorem 5]). We 
treat both the log-regression wavelet estimator and the wavelet Whittle estimator. 

In Section [3l we give a simplified formulation of the central limit theorem [l^, Theorem 2] 
and apply it to the Discrete Wavelet Transform setting, obtaining a result on the asymptotic 
distribution of the scalogram of a linear memory process as the scale index and the number 
of observed wavelet coefficients both tend to infinity, see Theorem [2j We then consider two 
estimators of the memory parameter d, the log-regression wavelet estimator in Section H] 
and the wavelet Whittle estimator in Section [5l Using Theorem [21 we show that both these 
estimators are asymptotically normal. 

2. Definition of the empirical scalogram of a finite sample 

We now introduce the wavelet setting and recall the definition of the scalogram and the 
empirical scalogram. Introduce the functions </)(t), t € M, and ijj{t), i G M, which will play 
the role of the father and mother wavelets respectively, and let ${^) *== Jj^ (/>(t)e~'^* dt and 
''PiO /k 'ip{t)e~^^^ dt denote their Fourier transforms. We suppose that the wavelets (j) 
and satisfy the following assumptions : 

(W-1) (j) and are integrable and have compact supports, 0(0) = J^(j){x)dx = 1 and 
J^i;\x)dx = l. 

(W-2) There exists a> 1 such that sup^g^ IV'(0I (1 + ICD" < oo, 

(W-3) The function ip has M vanishing moments, i.e. f^t^ip{t) dt = for all / = 
0,... ,M- 1 

(W-4) The function X^^g^ k^(/){- — k) is a polynomial of degree I for all / = 0, . . . , M — 1. 
We now define what we call the DWT of X in discrete time. Define the family {ipj^k,j > 
0,k £ of translated and dilated functions 

i;j,k{t) = 2-^/^i,{2-H-k) . (5) 

Using the scaling function (j), we first define the functions 

n 

Xn{t)''^'^Xk<Pit-k) and X{t)''^'^X,(l){t-k) (6) 
k=i fcez 
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The (details) wavelet coefficients are then defined as follows, for all j > 0,k £ 7j, 

Wj,k = / X(t)V'i,fc(t)dt. (7) 

J —OD 

These wavelet coefficients are the DWT of X. If the support of the scaling function (j) is 
included in (— T, 0) for some integer T > 1, then x„(t) = x(t) for alH = 0, . . . , n — T + 1. 
If the support of the wavelet function tp is included in (0,T), then, the support of ipj^k is 
included in the interval {2^k, 2^ {k -\-T)). Hence 



Wj,k= / X„(t)V',-,fc(i)dt , (8) 

J — oo 

for all {j, k) £ In, where 

In = {{j,k): i>0,0<A;<2-^'(n-T + l)-T}. (9) 

For any j, the wavelet coefficients {Wj^k}k€Z are obtained by discrete convolution and 
downsampling. More precisely, under (W-1) , for all j > 0, k £ Z, 

Wj,k = Y.^i h2^k-i = (hj,. * X),,, = [/i,, * X])k, (10) 

where hj^i *== 2"-^/^ 'Pi^ + 0V'(2~-'i) dt, * denotes the convolution of discrete sequences 
and, for any sequence {ck}k£Z, (i"' c)^ = Cf.2j- For all j > 0, Hj{X) =^ X^^g^ denotes 
the discrete Fourier transform of {hj^i}i^z, 

Hj{X) = 2-^/2 r (j){t + l)e~'^^i,{2~H) At. (11) 
For all j > and all m = 0, . . . , M - 1, 

/oo 
i^{2-H) (Pit + i)rdt . 



Under assumption (W-4) , 1 1— > X]/gz 0(^ + 0^™" is a polynomial of degree m and (W-3) there- 
fore implies that X]/ez ^j-i ^™ ~ equivalently, the trigonometric polynomial Hj satisfies 
0, m = 0, . . . , M — 1 and thus admits a zero at of degree at least equal to 



d^"" A=o 

M. Therefore, Hj{X) can be factorized as 

Hj{X) = {l-e'Y'Hj{X), (12) 

where Hj{X) is a trigonometric polynomial. Hence, the wavelet coefficient (llOp may be 
computed as 

Wj,k = a' [h-*^^'x])k (13) 

where {hj i}i^z are the coefficients of the trigonometric polynomial Hj. 
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Let then {(j), ■0} be a pair of scale function and wavelet satisfying (W-1) - (W-4) , Let 
X = {Xfc, A; G Z} be a process such that A.^^ X is weakly stationary and define the DWT 
{^j,k^ i ^ 0, k £ Z} of X by ([71). By (fTlI|) . {Wj^k, A; G Z} is a weakly stationary process 
for all scales j > 0. 

Definition 2. The scalogram of X is the non-negative sequence {cr|, j > 0} of variances 
of {Wj^k, k G Z}, namely 

a]''^'YaT[Wjfl]=K[WlQ], j>0. (14) 

Remark 1. Observe that, under Assumption [2l if M > K, then A.^'^X is a centered weak 
stationary process and the scalogram of X is well defined. 

Wavelet estimators of the memory parameter d are typically based on quadratic forms of 
the wavelet coefficients. This is reasonable because, for large scale j, log cr| is approximately 
an affine function of j with slope (2 log 2) d (see [6|]) and, given n observations Xi, . . . , X^, 
aj can be estimated by the empirical second moment 



fc=0 

which is a quadratic form on the wavelet coefficients. Here we denote by nj the number of 
available wavelet coefficients at scale index j, namely, from Q, 

nj = [2-^{n-T + l) -T+1] , (16) 

where T is the size of the time series and [x] denotes the integer part of x. It is important to 
note that although the wavelet coefficient Wj^k does not depend on n, the empirical second 
moment cj? does through Uj. 



Definition 3. Let {</>, "0} a pair of scale function and wavelet satisfying (W-1)- (W-4) 
and n > 1. Let us denote the maximal scale index J = J{n) by 

'n-T+ 1 



J =^ max{j : nj > 0} 



log2 



T 



(17) 



where nj is defined by U6\). The empirical scalogram of the sample {Xi, . . . , X^} is the 
non-negative process {(t|, j > 0}, where 

• for all j = 1, . . . , J, (t| is defined by [T5\) . 

• and by convention, a| = for j > J. 

The estimator of the memory parameter d can then be obtained as follows : 
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(1) by regressing the logarithm of the empirical variance log(cj?) for a finite number of scale 
indices j S {L, ...,[/} where L is the lower scale and U < J is the upper scale in the 
regression, see Sectional 

(2) by minimizing a contrast derived from the likelihood of an array of independent Gauss- 
ian random variables each row j € {L, . . . , U} of which having empirical variance 
log(<Tj^), see Section [5l 



3. Joint weak convergence of the empirical scalogram of a linear process 



We let — > denote the convergence in law. For convenience, we first state a CLT based 
on results of [10(]. 



Theorem 1. Let {vi j{t), t £ 7^} be real-valued sequences satisfying J^tez '^f j^^^ ^ ^ f^'^ 
i = 1, . . . ,N and j > 0. Suppose that there exist 6 > 1/2, e G (0, vr], a sequence of [— vr, vr)- 
valued functions defined on A € M and continuous functions v*^, i = 1,...,N, 

defined on R such that 



sup sup 2-J'/>*.(A)|(l + 2^ \X\Y < oo , 

j>0 \X\<e 

lim 2-j/2i;*,(2-JA)e'*^(^) = v*^{X) for all A G M , 

j— ►oo 
1/2 



n 



1(|A| > e) Kj(A)p dA ^ as j ^ oo , 



where v* j denotes the Fourier series associated to the sequence {vij{t), t £ Z}, 

<,.(A) = (2vr)-V2^^,^.(i)e-iA*. 

Define {Zij^k, i = I, ■ ■ ■ ,N,j >0,k £ Z} as 

Zi,],k = ^t;»,j(2JA: - t)6 , i = 1, . . . , Af, A; G Z, j > , 



(18) 
(19) 
(20) 

(21) 
(22) 



where {^j, t £ Z} satisfies (A-1). Then, for any diverging sequence (nj), as j — > oo, 



-1/2 



where T is the covariance matrix defined by 



E 

k=0 



>AA(o,r) , 



(23) 



E<°o<',oo(^ + 2pvr) 



dA , l<i,i'<N 



(24) 
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Moreover, one has 

/oo 
|<„,(A)|'dA<oo. (25) 



Proof. Observe first that we allowed e = vr, in which case Condition (j20p is always satisfied 
since the integral vanishes for all j > 0. The CLT (I23p is a strict application of Theorem 1 



m 



10( 1 for e = vr and Theorem 2 in |lo| for e < vr. Using the notations of [lO| we have here 



7j = 2^ for all j > and Aj^oo = for all i = 1, . . . , A^. Observe that in this case, in these 
two theorems, Cj^j/ = 1 and w*^, = v*^v*i ^ for all = 1, . . . ,N. The limit (|25p follows 
from Relation (15) in Proposition 1 in the same paper, see also Remark 11. □ 

Remark 2. When the convergence rate to the limit (j25p is fast enough, we will be able to 

2 

replace the expectations in ([25P by ^i'cx>("^) i = ^, ■ ■ ■ , which does not depend 

on j. 

In the Gaussian case the spectral density of the wavelet coefficients is sufficient for study- 
ing the quadratic forms ()15p since second order properties fully determine the distribution 
of the wavelet coefficients. This is not so in the linear case, which we consider in this paper. 
It is possible, however, to establish a multivariate central limit theorem for the empirical 
scalogram. 

Theorem 2. Let X he an M[d) process with short-range spectral density f* and suppose 



that AssumptionlE holds. Assume that (W-1)- (W-4) hold with 



1/2- a <d<M and K<M. (26) 

Let L = L(n) be a scale index depending on n such that L{n) — > oo and n2~'^("^ —> oo as 
n — > oo. Assume that one of the two following conditions hold. 

sup /*(A) < oo (27) 

Ae(-7r,7r) 

(^2-Mn))l/22L(n){l-2a-2d) aS n ^ OO . (28) 

Then, as n ^ oo, one has the folowing central limit: 

{ V;;^2-2^(")^(ai(„)^, - ai(„)^J, n > 0} ^ {q^\ n > 0} , (29) 

where q['^^ denotes a centered Gaussian process defined on N with covariance function 



^u,u'{(i) = Gov (^Qu\ Qui^j , u,u' > 0, given by 

K,u'{d) 4^ (/*(0))2 2Mw.')+"An' r |D^,|,_,,|(A)|' dA , (30) 

J — TT 
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with, for all u > and A E (— 7r,7r), 



Doo,«(A) = 2^ |A + 2/7r|-2^e„(A + 21tt) ^(A + 2Z7r)^(2-"(A + 2/7r)). 



(31) 



and euiO = 2-"/2[i, g-'^-?, . . . , e-i(2"-i)2-"?]T. 

Remark 3. We assume 1 — 2a — 2d < in (|26p so that (|28p imposes a sufficiently fast 
growtli rate on L{n) as n — > oo. On the other hand this rate has to be slow enough for the 
assumption n2~^^'^^ — > oo to hold. 

Proof. In ([U, the sequence {a^^\t), f € Z} depends on K. To define a quantity which 
does not, we go to the Fourier domain and set 

where the sum over t G Z converges in the sense of L^(— 7r,7r). This function a*{X) satisfies 



|G*(A)|2 = |l-e-"|-2'^r(A) = /(A) 



(32) 



where / is defined in ([2|). Moreover, by (jlSh . since K < M (see Condition (126p ). for all 
J S N and A; G Z, the wavelet coefficients of X can be expressed as 



Since hj . is a finite sequence, we obtain that 



(33) 



where {aj(t), t G Z} is the sequence hj^. -k A*'^ ^{a^^^) which is characterized by the 
L^(— 7r,7r) converging series 

def 



2*{X) (27r)-V2 Y^aj{t)e-'^' = ^,(A)(1 - e-^)^a*(A) , 

tez 



which, in view of (I12p . can be simply written as 

a*iX) = Hj{X)a*{X) . 
To prove the theorem, we need to show that, for any integer i > 0, one has 



(34) 



(35) 



(77 - a} 



"2 2 



^J\f{0,[A^,u'{d), u,u' = 0,...,£\) . (36) 
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To this end, we will apply Theorem [T] by relating the right-hand side of (j36p to the right- 
hand side of (j23p and by expressing the empirical scalogram < n < ^ in terms of 
^i,j,k with adapted indices j, k and i. 

We let j = L+i, that is j is the maximal scale in ()36p . We let k take values k = 0, . . . ,nj, 
where nj, given by (jl6p is the number of wavelet coefficients available at the maximal scale 
j. In Theorem [H Zij^^ is viewed as the i*'' component of a fc-wise stationary vector, with 
i = 1, . . . , N . In order to recover this stationarity from the set of wavelet coefficients used 
to compute the empirical variances in (136p . we do as follows. We represent i as i = 2^~" + v 
where u£{0,...,i} and v e{0,..., 2^"" - 1} and let N = ^iLo 2^"" = 2^+^ - 1. For each 
(u, v), we set i = 2^~" + v, j = L + £, and 

(t) = 2^^'^ai+„(t + ^;2^+"), t G Z , (37) 

where ul+u is defined in (I33p . Thus if we focus on a scale j' > L and express it as 
j' = L + u = j — l + u (see Figure [I]) , we have 

Vi^j{t) = 2-^'^aj,{t + v2^'), t£Z. (38) 

Hence, by definition of Zij^k in ()22|) . one has 

= 2~^'^ J2 aj'{2^'{2^-''k + v}-t) ^ 

= 2-^'^M/., 2.-«fe+, . (39) 

By (HH), 

n^v = 2-(j-^+")(n - T + l)-T + l = 2^"" + (T - 1)(2^-" - 1), 
and hence bv (fTl) . (fTSjl and (l39l). 

2-^^'^ - 4) = n/2-2^'^ ^ (W,H,,, - E[Wl„]) 

k'=0 

v=0 \ k=0 J 

where j' = j — £ + u, k' = 2^~"/c + v, i = 2^~^ + v and 

(T-l){2*-"-l)-l 

Rj'=nj' E {zLn,-^[zlj,n,]} ■ (41) 

v=0 
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We then have 



-2Ld 



where 



A, 



def 



^7^,■— 1 I 



^n,■ — 1 I 



^n,■ — 1 I 



72 







Rl 




+ 


Rl+1 


nzl,,k]} _ 




Rl+i 



2 times 



0... 



...0 



n, 



times 



... n^l^ . . . n^l^ 



0...0 



(42) 



(43) 



0...0 0...0 

is an {1+1) X N matrix. The entries are n-J'^ = ^Z+w where u goes from (top hue) to £ 
(bottom hne), see Figure [H 



L 



u 



J - J 



u 



L + e 



Figure 1. This figure indicates the relationship between the various variables. 

Let us check that the assumptions of Theorem[T]apply to (j37p , that is, we show that (jl8|) , (pO 
and ([19]) are verified for Vij{t) defined by ([37]). Using §8^), ([21]), dS]) and ([35]), we get 

vl.{X) = 2-^'^ e'^^2^' a*, (A) 

= 2-"e'^^2^' Hf{X)a*{X) . 



By continuity of /* at the origin we have sup;^£(_£ ^-j /*(A) < C for some C > and 
e £ (0, vr]. Morever, under ([27j) . we may set e = vr. By ([32]) and ([2]) we get, for all 
A G (-e,e), 

|a*(A)| < |1 - e-'^l""^ VC<VC \X\~'^ . 
By 0, Proposition 3], we have, for all A G (— 7r,7r), 



/7j/(A) - 2^'/^4>{X)TP{2fX) < C 2J''(i/2-")|A|^ 
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and 



< C 2^'/2 |2i'A|^^ (1 + 2^"|A|)-"- 



■M 



where C is a positive constant and a and M are defined in (W-2) and (W-3) , respectively. 



Using that j = L -\- i and j' = L + u for some u only depending on i and L,j' = j + 0{1) 
the last 4 displays and Condition ([26|) easily yield 



(44) 
(45) 



|<j(A)| < C 2^/2 (1 + 2^\X\)-»-'i for ah A e (-e,e) 
|<j(A)|2 dA < C 2^(i-2"-2<i) £ f*{X) dX , 



and the bound 



(/.(2-JA)?/;(2«-^A)a*(2-U) 



= 2-j/2-Ld |a*(2-JA)| i7j/(2-JA) - 2J'/20(2-JA)v5(2«-a) 

< C 2^J'("+*^) lAI*^"*^ , (46) 

valid for 2~-'|A| < e with C denoting some positive constant depending neither on A nor 
on j > 0. Relation (jH]) is ([H]) with 6 = a + d > 1/2. Under ([27]), e = vr and (1201) 
trivially holds (see the proof of Theorem [1]). Otherwise, since (jl6p and n2~^ oo imply 
rij ~ n2^^ = n2"-^~^, Relations (j45p . (j28p and the fact that /* is always integrable away of 



-i\\K—d f* 



/*(A) is a spectral density and |1 — e" 



is lower bounded 



the origin (since 1 1 — e' 

for A away of zero) imply (j20p . By (W-1) , (j) is continuous at the origin where it takes value 
1 and using ([MD, | 

as J — > oo . 



I, (132]), ([2]) and the continuity of /*(A) at A = 0, we have, for all A G M, 
<^(2'JA)2-^^|a*(2-JA)| ^ 2^'^ VfW) IM"^ 
Hence we obtain (fT9]) with 



vl^{X) = 2(«-^)/2+^'^ e^^^2"-^ v7*(0) |A|-^ V^(2«-^A) 
for all i = 2^-" + such that u e {0, ...,£} and v e {0, ... , 2^"" - 1}, and 



(47) 



l«*(A)| ■ 

Since (fT8]) . ([20]) and (fT9]) hold and — > oo, we may apply Theorem [1] and obtain (p3|) . 
Observe that and n2^^ oo imply, for j' = L + u e {L, . . . ,L + £}, 



Uj, = n2-^ + 0(1) 



77,2^"— >oo as n— >oo. 
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Since (n2-^)i/2n^-i ~ (n2-^)i/2(n2-^-")-i = {n2-j)-'^l'^2~^l'^2'' ~ (03]) 
yields, as n — > cxo, 



Vn2-^ A„ ~ n- 2"^/^ Aoo with 



dcf 



2 times 

0... ...0 

times 

0...0 2^^ 0...0 
2^ 0...0 0...0 0...0 



The general term is 2^ for ti G {0, . . . , £}. Relations ([25|) and (jlTj) give that, for j' = L + it G 



Vn2-^E[/2jv] = O ((n2-^)-^/2\ ^ q 



as n — > oo . 



Applying (j23l) . (j42|) . the two last displays and Slutsky's lemma, we get (j36]l with 
A(d) = 2-^ AooFA^ = 2"^' 



1 2*-" -1 



2M+« ^ ^ r2£-„_,_^^2''-"'+t)' 
D=0 d'=0 



0<M,-U'<^ 



where the indices (u, n') run from (0, 0) (top left corner) to (u, u') = (^, ^) (bottom right 
corner) and Fj^j/ is defined by (f24]) with f *^ and defined by (f47l) for i, G 1, . . . , = 
2^"*"^ — 1. To conclude the proof, it remains to check that the entries of A„^„/((i) as defined 
above are equal to those given in (j30p . We shall do that for u' > u since the alternative 
case is obtained by observing that A.u^u'{d) = ^u',u{d)- Replacing Fj j' and then v*^ and 
V*, ^ by these expressions and denoting 



def 



Ap = A + 2pTT, A G M, p G Z 



we get, for < u,u' < i, 

2' 

Au,u'{d) = 2~'+^+^' 4vr(r(0))^ 



v=0 v'=0 



^^iA,(.2"-^-.'2^ -0 |Ap|-2^V'(2«-^Ap)V'(2«'-^Ap) 



dA 



(/*(0))2 4^ 2 



2(?i+M')+^{4d-3) 



t)=0 



/ Gti,n',t,(A) dA , 



(48) 
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where Gu,u',v is a (27r)-periodic function defined by 

„(„2"-'^-u'2"'-*) 



v'=0 



with 



=' |2^-"'A|-2'^V^(2«-«'A)^(A), a G M . 



Writing p = 2^ ^' q + r with g G Z and r G {0, . . . , 2^ — 1} and transforming a sum over 
p into a sum over g and r, we get 



E 



2i—u' 
r=0 



■u'=0 



Y: e-^-'2"'-^/.„,„,.(2'^'-^A.) 



r=0 



where 



Hence 



1 ni-u' 



I 2'--^ -1 2*^"" -1 



Gu,u',vW= E E E e-i2.{.-r'K2"-;,^_^,_^(2"'-^A,)/i„x,.(2'''-^A,0. 

«'=0 r=0 r'=0 

In the last display, observe that v' only appear in the complex exponential argument. 

Moreover we have Ylv'=o e"'^'^^^"^'-''''^" ^ = except for r = r' in which case it equals 
2^-"'. Hence, 

Gu,u',v{^) = 2^ " piM,u',j;(2" ^Xr, 



r=Q 



Applying [l^. Lemma 1] with g = | and 7 = 2^ gives 



G'u,u',t,(A) dA — 2^ " / \hu,u',v{'^^ ^Ar) 



r=0 



dA 



\hu,u',vW\ dA . 
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Inserting this equality in (flHj) and using that (d) = A„/_„((i), we get, for ah < u < 



E 



dA . 



For V G {0, ...,2^-" - 1}, we write v = v' + /fc2"'-" with v' G {0,...,2"'-" - 1} and 
k £ {0, . . . , 2^~" — 1} and transform the sum over v into a sum over and k. But 



since exp |i2-("'-")?;A5} = exp |i2-("'-")?;'Ag} exp{iA:A}, and Ylk=o 



JfcA|2 _ r,e~u' 



we 



obtain 

K,u'{d) = {f*{0)f 47r 24rf«'+«2-("'-") 



X E r El^'?l""'«''"'"'""''''''^(2-(«'-«)A,)^(A,) 
w'=0 •^"'^ qez 

Relation ()30p finally follows by observing that the vector with entries 

2-K-«)/2 |^i2-("'-").'A,^ = 0, . . . , 2"'-- - l} 

is precisely e„/_„(Ag). 



dA . 



□ 



To obtain a result valid for an asymptotically infinite weighted sum of the empirical 
scalogram {o'i+M — n > 0} as in Theorem [3] below, we need a bound for the covariance 
^u,ii' (d) defined in (|30p and a bound for the centered empirical scalogram. The two following 
results provide the bounds. 



Lemma 1. Suppose that if) satisfies (W-1)- (W-3) and let d G (1/2 — a,M]. Then, there 



exists C only depending on d and ip such that, for all u > 0, 

r \Ooo,u{\)? dA < C 2"(i/2-2d) _ 

J —IT 

Proof. See Relation (72) in An alternative is to use that Ao,u{d) = Gov ) 
and thus the Cauchy-Schwarz Inequality yields |Ao,M((i)P < |Ao,o('^)||Au^„((i)|. Using ([30]) 
we get, setting /*(0) = 1 

4712^"^" r |Doo,«(A)|2 dA < 4^2(2'^+V2)« T |d^,o(A)|' dA . 

J —TV J —TV 

The results follows from the fact that 1000,0(^^)1 is bounded for d G (1/2 — a,M] un 



der (W-1) - (W-3) , see Remark 1 in 



□ 
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Lemma 2. Let X he an M{d) process with short-range spectral density f* and suppose that 
Assumption holds. Assume that (W-1)- (W-4) hold with Condition i26\) on M and a. 
Then, there exists a positive constant C such that, for all n > 1 and j £ {0, 1, . . . , J}, 

E [|<t| - a]\] < C {2(V2+2db- ^-1/2 ^ 2J-(i-2a)| (49) 



If moreover Condition [27\ j holds on /*, one has 



(50) 



Proof. We use the same notations as in Theorem [2] to express (t| in terms of a decimated 
hnear process, but since here only one scale needs to be considered, we take L = j = j' 
(hence u = i = and i = I). In this case (j40]l reads as 



-2i(i/2.2 ^2^1 



(51) 



k=0 



where Zij^k = Z^tez ^ij "^^^^ = 2 ^'^aj{t). If ([27|) holds, then uij satisfies ([T8] 

with e = IT and (jl9p (see the proof of Theorem [2]) and, by Lemmas 5 and 6 in [lO(], we get 



supVar j Uj ^ Zf j ,^ j < oo . 



(52) 



k=0 



Since Uj x n2-^ for j G {0, 1, . . . , J}, ([50]) follows. 

If (|27p does not hold, vij satisfies (|18p for some e > which may no longer be taken equal 
to vr (as a consequence of (I44p in the proof of Theorem [2]) and, applying [id . Proposition 4] 
with Xij = 0, we get 



2-2irf(^2 _ ^2) ^ ,^-1/2 



fc=0 



where .Z'l j ^ satisfies (|18p with e = vr and (|19p and hence 
positive constant C not depending on j, 



E[\R,\]<C 



1/2 1/2 

n- ij + J^. 



and Rj satisfies, for some 



(53) 



where 



del 



l(|A-Ai,oo| >e) \vlAX)Y dA 



(54) 



Since /* is always integrable away of the origin, the bound (j45p implies (recall that here 
L = j) 

\vlJX)\^ dA < c 2(^-2"-2'i)J . 
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Hence, we obtain, for some constant C not depending on j nor n. 



E 



0-27d|i.2 ^2 1 

2 \aj -aj\ 



<Cn] 



'1/2 



<Cn 



'1/2 



J 



1 + ^l/22(l-2Q-2d)i _^ 2(l-2"-2rf)i/2' 



2 + „l/22(l-2a-2d)i 



where we used 1 — 2a — 2d < in Condition p6|) . Since rij >c n2 Relation (|^9|) follows. □ 
We now prove the main result of this section. 

Theorem 3. Let {wu{n-),n,u > 0} be an array of real numbers such that Wu{n) — > Wu for 
all u > as n ^ oo and 



lim y sup|u;„(n)|2(^/2+2d)« ^ q _ 



M>£ — 

Then, under the assumptions of Theorem\^ as n ^ oo, 

J-L 



(55) 



(56) 



M=0 



?i,«'>0 



where J is defined in (11). 
Proof. We denote, for all ^ > 0, 

I 

and 

e 



u=0 

Theorem [2] then gives that, for any £ > 0, as n ^ oo, 

{d)Wu' 

\ o<u,u'<e 

Note that (f55|) implies 12(^/^+2'')" ^ as £ ^ oo, hence, using LemmaHJand ([30 

we have 

Y \^uAu,u'id)wu\<C |«^.«^.|2^'^"'+"2("'-")(V2-2d) 

l<u,u' i<u<u' 



(l/2+2d)u 



< C ^h„|2 

— > as £ — > oo. 
The left-hand side of ([56]) is Sn,j~L- We decompose it as 



Yj I"^"'!^ 



(l/2+2d)u' 



Sn,J-L = [Sn,J-L - Sni] + 
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From the last 3 displays and applying [l|, Theorem 3.2], it is sufficient to prove that 



lim limsupE 



15. 



n,J—L 



Sn.el + 



5, 



Sn 







(57) 



To obtain this limit, we need to separate the case where Condition (j27p holds from the one 
where it is replaced by Condition ([28]). Under Condition ([27|) . we apply ([50|) : under ([28]) . 
we apply (j49p . Let us for instance check the second case (the first one is similar, although 
simpler). The bound (l49]l implies 



E 



(Si, 



M=0 



which, using Wu — > Wu{n) and (p8]) . tends to as n — > cx) for all ^ > 0, and 

¥.\\Sn,J-L-Snl\] 



< c 



\wu{n)\2^'/^+^^> + 2^(1-2-2'^) |«;„(n)|2"(i-2") 



< C 



1 + 2^(l-2a-2d)^;^J ^ |^„^(„)|2(V2+2d)« ^ 



(since 1 — 2a < 2(i in Condition (p6]) ) which tends to as n ^ oo followed by £ ^ cxd by ([28]) 
and ()55p . This yields ()57p . which achieves the proof. □ 



4. The log-regression estimation of the memory parameter 

The wavelet-based regression estimator of the memory parameter d involves regressing 
the scale spectrum estimator (t|, defined in (jlSp . with respect to the scale index j. More 
precisely, an estimator of the memory parameter d is obtained by regressing the logarithm 
of the empirical variance log{af) for a finite number of scale indices j £ {L, . . . ,L + i} where 
L = L{n) > is the lower scale and l + l > 2 is the number of scales used in the regression. 
For a sample size equal to n, this estimator is well defined for L and i such that i > 1 and 



L + £< [log2(n-T + l)-log2(T)] , 



(58) 



where the right-hand side of this inequality is the maximal index j such that rij > 1. The 
regression estimator can be expressed formally as 

L+e 

dn{L,w) = ^ log (<t|) , (59) 



where the vector w 



dcf 



[Wo, . 



Wi] of weights satisfies 



Wi = and 2 log(2) ^ iwi = 1 . 



(60) 



i=0 



i=0 
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One may choose, for example, w corresponding to the weighted least-squares regression 
vector, defined by 

w = DB{B'^DB)-^h , 



where B 
and 



dcf 



1 1 

2 



1 



is the so-called design matrix, is a definite positive matrix 



dcf 



b = [0(21og(2))-if. (61) 
Ordinary least square regression corresponds to the case where D is the identity matrix. 



In [6|], the process X was assumed Gaussian and a bound for the mean square error 
and an asymptotic equivalent to the variance of (i„(-L,w) were obtained. The asymptotic 
normality is established in also under the Gaussian assumption. Here we show that the 
asymptotic normality holds under the weaker linear assumption. 



Theorem 4. Let X be an M{d) process with short-range spectral density f* and suppose 

(62) 



that Assumptions^ andl^hold. Under (W-1)- (W-4)with 

{l + P)/2-a<d<M and K<M, 
if, as n ^ oo, L{n) is such that 

(n2-^W)"l+n2-(l+2/3)L(n)^Q^ 

then one has the following central limit: 

Vn2-^(") (^dn{L,w) - d^ (0, w^V(d)w) 



where 



Kid) 



dcf 



\^\-'"'\m\''dc 



and V(d, ^) is the {1 + £) x (1 + £) matrix defined as 



dcf 47r22'^IJ-*l2^^J 
K(^i)2 



|Doo,|j-i|(A)| dA 



<i,j < 



(63) 

(64) 
(65) 

(66) 



Proof. The only assumptions of Theorem [2] that are not included in our set of assumptions 
here are ()27p and (|28p. The former is verified if e = vr in Assumption[TJ If e < vr, (j28p holds as 
a consequence of ()62p and (j63p . Hence Theorem [2] applies (note that (j62p implies (j26p since 
/3 > 0). Applying [7|, Theorem 1], under (W-1) - (W-4) , we have the following approximation: 

(j| - /* (0) K(d) 2^^'^ <Cf* (0) L 2(2'^-'5)j' (67) 

where^a^ is defined in and K{d) in This, with Theorem [21 0, Relation (39)] 

□ 



and [^, Proposition 3], gives the result. 



ASYMPTOTIC NORMALITY OF WAVELET ESTIMATORS 



19 



Remark 4. The centering in (j29p in Theorem [2] involved the expected value whereas the 
centering in (j64p in Theorem H] does not. In order to deal with the corresponding bias, 
Condition (j26p is strengthened by 



5. The wavelet Whittle estimation of the memory parameter 



We now consider the semi-parametric estimator introduced in [7[. As the log-regression 
estimation, this estimator is also based on the scalogram but is defined as the maximizer of 
a Whittle type contrast function (see 3, Eq. (20)]), 

log {Y.2-'^'^n,a]] +2d'log(2)J 



dn{L, U) *== Argmin 

d'gM 



\j=L 



with J 



def 2^j=L3 "-i 



Eu 
i=L 



dn, which involves the scales L < j < U, is a wavelet analog of the local Whittle Fourier 
estimator studied in 0] (often referred to as semiparametric Gaussian estimator) and is 
therefore called the local Whittle wavelet estimator. To prove the asymptotic normality of 
dn, we will use Theorem [31 

We denote, for all integer £ > 1, 



Ve 



dcf 



E. 

j=0 



2 -J 



p\d,i) 



dcf 



2-2- 



TT 



and Ki = ^(j 



2-J 



j=0 



2-2- 



(68) 



(2 - 2-0«Klog(2)K(d))2 



p^{d,oo) 



u=l 
dcf TT 



i=0 



2-2- 



j{i - rje){i + u - r]e) 



[21og(2)K((i)]2 



Io(d) + 2f;i„(d)2(2'^~i)«l , 



(69) 
(70) 



u=l 



where K{d) is defined in ([65]) . 

Theorem 5. Let X he an M{d) process with short-range spectral density f* and suppose 
that Assumptions[l\ and\^hold. Under (W-1) - (W-4) with Condition [6^1 on a and M, if, 
as n ^ oo, the lower scale L{n) is such that 

L(n)(n2-^("))^V8 + ^2~^i+2p)Lin) ^ q ^ (7^) 

and the upper scale U{n) is such that 

U{n) - L{n) {l,2...,oo}, 
then one has the following central limit: 

\/n2-^(") ((i„(L(n),[/(n)) -d) ^M{0,p\dj)) , (72) 



20 F. ROUEFF AND M.S. TAQQU 

where p'^{d,t) for £ < oo and i = oo are defined in i69\) and { 70) respectively. 



Remark 5. Condition (j7ip is similar to (j63|) but replaced by the stronger 

condition L{n)(n2~^^^'^)~^^^ 0, which holds for example, if n2~^^"''^ /rC has a positive 
limit for some 7 > as usually verified. 

Proof. Assume /*(0) = 1 without loss of generality. The proof is the same as that of 0, 
Theorem 5] until Eq. (66), 

(n2-^)-i/2s„ 



(n2-^)i/2 {dn - d) 



2 1og(2)K(d) {2-2-iU-L))Ku-L 



(l + op(l)) , 



(73) 



where 



u 



Sn = E[J■-^]2-2^■'^A'• 
Thus, using ^^^[j — J] rij = and (fTH) . we get 

u 

^[Sn] = Y.^j-J]n, (2-2^'^a|-r(0)K(ff)) =0(n2-(i+^)^) = o((n2-^)i/2 



where the O-term follows from ([67|) . the fact that L < J < L + 1 (see 0, Eq (61)]) and 
rij < n2~^ and the o-term follows from (jTip . 

Hence it only remains to establish a CLT for S„, similar to that of 0, Proposition 10] but 
under the assumptions of Theorem [5l This is obtained by observing that 



U-L 



(n2-^)-i/2s„ = (n2-^)i/2 2-2" yj^{n)a^ 



ji=0 



with u = j — L and 



Wuin) = [u-{J-L)]2-''-''^, uG{0,...,U-L} , 

which satisfies 

sup \wu{n)\ < C ?x2-(2'^+^)", ue {0,...,U - L} , 

n 

and by applying Theorem [3l 

Appendix A. Wavelet coefficients linear filters 



□ 



Assumption (W-1) implies that (p and il^ are everywhere infinitely differentiable. When (W-1) 



holds, Assumptions (W-3) and (W-4) can be expressed in different ways. (W-3) is equivalent 



to asserting that the first M — 1 derivative of ip vanish at the origin and hence 

|^(A)| =O(|A|^-0 as A^O. (74) 
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And, by Theorem 2.8.1, Page 90], (W-4) is equivalent to 



O(|A|^0 as A^O. (75) 
Many authors suppose that the ipj^k are orthogonal and even that they are generated by 



sup|0(A + 2kiT) 



a multiresolution analysis (MRA). Assumptions (W-l) -(W-4) in Section [3] are satisfied in 
these cases, (j) being the scaling function and V is the associated wavelet. In this paper, 
however, we do not assume that wavelets are orthonormal nor that they are associated to a 
multiresolution analysis. We may therefore work with other convenient choices for (p and "0 



as long as (W-1) - (W-4) are satisfied. A simple example is to set, for some positive integer 



<^(x)1i^lfjj(x) 



and ip{x) =^ Cn ^ 



N 



dx^ [O'll 

where 1^ is the indicator function of the set A, denotes the A^-th self-convolution of 
a function / and Cjy is a normalizing constant such that J^^^p'^{x)dx = 1. It follows that 



|,/.(OI = |2sin(C/2)/^r and |^(e)| = C;v 1^1^' |2 sin(e/2)/e| 



TV I 



\2N 



Using and ([75]) . one easily checks that (W-1) - (W-4) are satisfied with M and a equal 
to N. Of course the family of functions {ipj^k} are not orthonormal for this choice of the 
wavelet function ip (and the function tp is not associated to a MRA). 
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